Statistical Treatment of Rain Gauge 
Calibration Data 

By MRS. A. E. FREENY 

(Manuscript received December 2, 1968) 

This paper describes the statistical treatment of the calibration data of 
the capacitance gauges used for measurement of rain rates in the rain gauge 
network set up in a 160 square kilometer area surrounding Crawford Hill, 
Holmdel, New Jersey. The expression R = ae~ pt , where R is the rain 
rate in millimeters per hour, f is the frequency of the rain gauge oscillator, 
and a and /3 are fitted coefficients, allows calibration curves to be developed 
for all 99 of the devices in the system with an overall equivalent standard 
error of 7.3 millimeters per hour. This paper discusses the distribution of 
parameters and residuals and gives a refinement which corrects for a 
fitting bias. 

I. INTRODUCTION 

This paper describes the statistical treatment of the calibration data 
of capacitance gauges used for measurement of rain rate in the rain 
gauge network set up in a 160 square km area surrounding Crawford 
Hill, Holmdel, New Jersey. Section II describes the original calibra- 
tion data plus added observations, Section III discusses the least 
squares fits to the individual oscillators and their results, and Section 
IV describes the correction made for bias. 

II. CALIBRATION DATA 

The calibration data consisted of approximately 20 pairs of readings 
for each of the 100 gauges. 1 ' 2 These readings, which consist of fre- 
quency in KHz and rain rate in mm per hour, were obtained by pour- 
ing water via a flowmeter through the gauge at a given rate and 
recording the corresponding oscillator frequency reading. The data 
used includes the original calibration readings plus supplementary 
readings made in the spring of 1967 when a field check on the original 
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Fig. 1 — Rain rate versus frequency, oscillator 47. 

calibrations was made. For oscillator 47, which shows a typical con- 
figuration, rain rate R is plotted against frequency / on both a linear 
and a semilog scale in Fig. 1. Points plotted with a "•" are the original 
calibration measurements; those with a "o" are added measurements. 



III. EXPONENTIAL FITS TO INDIVIDUAL OSCILLATORS 

The linear character of the semilog plots show that R = ae~^ f , that is, 
a simple exponential curve, is a reasonable description of the data. 
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After some consideration the exponential form, rather than the linear 
alternative In R = In a — /?/, was chosen so that the residuals would 
be equally weighted over the frequency range. This minimizes the 
relative errors at high rain rates. The data for each of the oscillators 
were put through a nonlinear least squares fitting program; estimates 
a and b of the parameters a and /3 were obtained. Figure 2a shows the 
fit of the data to oscillator 47. 

For some of the oscillators, it was clear from the additional readings 
that the calibration at this time was not the same as it had been at the 
time the original measurements were made. For these oscillators, the 
original data were not used and fitting was done on the supplementary 
data only. Two of these cases were left with only the three additional 
points, and one had only five points, so the quality of the estimates for 
these oscillators is poor. In all, six oscillators had less than ten points. 

Inspection of the residuals from the fits for the individual oscil- 
lators showed very good agreement between the calculated values and 
the observed values in most cases; however, a significant number of 
the oscillators showed two types of nonrandom scatter about the 
fitted curve. 

The first type is a pronounced lack of fit of the exponential curve to 
low rain rate data (R ^ 30 mm per hour) for oscillators 13, 21, and 35. 
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Fig. 2 — Exponential fits: (a) oscillator 47 [R = 883.84 exp(— 0.516/)] ; 
(b) oscillator 76 [It = 849.86 exp( -0.534/)]. 
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Fig. 3 — Normal probability plots of estimates from exponential fits to each 
oscillator: (a) alpha, (b) beta. 
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The fitted curve overestimates the rain rate by as much as 20 mm per 
hour at very low rates; but the overestimate disappears by about 30 
mm per hour. 

The second type of scatter is a tendency of the observed rain rate 
to lie below the curve for 4 <; / < 5 KHz, and above the curve for 5 
<, f < 9 KHz for gome oscillators. Figure 2b shows the fit of the curve 
to the data from oscillator 76 which follows this pattern. Several other 
straightforward functions were tested on a few of these oscillators but 
did not result in improvement of the residual patterns. However, the 
simple exponential equation expresses the relationship between rain 
rate and frequency quite well. 

Of the 100 oscillators, 89 had an error mean square of less than 
100, and 11 had an error mean square greater than 100. Inspection of 
the individual fits to the oscillators indicated that there was an outlier 
in the data of the oscillators having two of the 11 worst fits. These ob- 
servations were removed and new fits were calculated. These have error 
mean squares less than 100. Of the remaining nine worst fits, one was 
to oscillator 56 which was not being used in the rain gauge network, 
so the data from this oscillator were excluded from further calcula- 
tions. After removal of this data, the average value of the error mean 
square is 57.75. The mean square of the pooled residuals is 53.55, which 
is equivalent to a standard error of about 7.3. 

An informal assessment of the distribution of a and (3 may be made 
from normal probability plots of a and 6 which show the values of 
a and b plotted against standard normal quantiles for a sample of size 
99. 3 These plots are shown in Fig. 3. The plot of a shows an upper tail 
which is too long to be normally distributed. The plot of b has both 
tails too short for a normal distribution. It is also possible to interpret 
these plots as showing a slight noticeable curvature upward for a and 
downward for b. This suggests that the estimates from the equation 
in the form In R = A — (3J would have been more nearly normally 
distributed. 

Figure 4 is a normal plot of the pooled residuals for all oscillators. 
The outstanding feature of this plot is a change in the slope toward the 
upper tail of the distribution which could be produced by the residuals 
being from different distributions (hopefully of the same shape but 
with different parameters). This could occur if the oscillators were not 
all from the same population. 

The plot of residuals versus rain rate (Fig. 5a) shows the tendency 
of the residuals to be positive for 10 < 7? ^ 30 mm per hour, nega- 
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Fig. 4 — Normal probability plot of residuals from exponential fits. 

tive for 50 <> R <> 100 mm per hour, and positive again for 120 
<; R < 200 mm per hour. This agrees roughly with the tendency 
toward nonrandom scatter of the residuals about the individual 
exponential fits and is not unexpected. 

The plot of residuals versus frequency (Fig. 5b) shows the non- 
homogeneity of fit over the frequency range of the oscillators, that is, 
that the spread of the residuals about the fitted curve is not the same 
for all values of the frequency. The plot also demonstrates the tend- 
ency of the residuals toward oscillation about 0. 



[V. CORRECTION FOR CALIBRATION BIAS 

It was decided to try to remove the calibration bias shown as a 
cyclic trend in Fig. 5a. Accordingly a function of the form 

j*i = A sin (aR p -4- <p), 
where r, is the residual from the exponential fit and R is the associated 
rain rate, was fit to the pooled residuals. 
The resulting equation is 

r, = 3.813 sin (1.450/?" 3894 - 2.5fi0) 
with an error mean square of 46.33. 
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Fig. 5 — Residuals from exponential fits versus (a) rain rate and (b) frequency 
for all oscillators. 
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Fig. 6 — Sine fit of residuals from exponential fita to rain rate. 

Figure 6 plots this fit and shows half of the residuals selected at 
random. The residuals from this fit (r 2 ) are plotted against rain rate 
in Fig. 7a and against frequency in Fig. 7b. 

Figure 7 shows that the cyclic trend has indeed been removed by 
the sine function fit leaving a more random scatter of the residuals r 2 . 
The spread of these residuals is different at different values of rain 
rate and frequency as was the spread of the residuals rj.. A normal 
plot of the residuals r 2 looks substantially like that of Fig. 4, with 
the same change in slope toward the upper tail, and somewhat more 
curvature in the extreme tails. 



V. CONCLUSIONS 

A sine corrected exponential fit to this calibration data produces a 
pooled standard error of less than 7 mm per hour in the rain rate. This 
is not very much better than the pooled standard error of about 7.3 
mm per hour associated with the exponential fits alone. However, the 
sine correction has removed the average bias present in the exponential 
fits and will produce a small average error in the rain rates between 
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Fig. 7 — Residuals from sine fit versus (a) rain rate and (b) frequency. 



10 and 100 mm per hour which is where the majority of the data of 
interest lies. The normal plots of the pooled residuals from the ex- 
ponential fits and the residuals from the sine corrected fit are suf- 
ficiently similar so as not to give preference to either the plain or 
corrected exponential fit. 
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In processing the rainfall data collected by this network in 1967, 
the individual exponential coefficients for each gauge were used to 
calculate rain rates from the recorded frequencies. The sine correction 
was then applied to the rain rates from all the gauges. If the correction 
produced a negative value, was substituted. No further corrections 
were made to the fits to the three gauges which showed overestimation 
of rain rates ^ 30 mm per hour. These rain-rate values were recorded 
for future analysis. 
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